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Abstract 

Testable weighted graph parameters and equivalent notions of testability are investi- 
gated based on [4] . We prove that certain balanced minimum multiway cut densities 
are testable. Using this fact, quadratic programming techniques are applied to ap- 
proximate some of these quantities. The problem is related to cluster analysis and 
statistical physics. Convergence of special noisy graph sequences is also discussed. 
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1 Introduction 



A typical problem of contemporary cluster analysis is to find relatively small 
number of homogeneous groups of data that do not differ significantly in size. 
To make inferences on the separation that can be achieved for a given number 
of clusters, some types of minimum cut densities are investigated. 

In a fairly general setup of [J], the objects to be classified are vertices of a 
weighted graph whose edges and vertices both have nonnegative, real weights. 
Edge-weights are similarities between the vertices normalized in such a way 
that is the minimum and 1 is the maximum similarity, while vertex-weights 



* Corresponding author. Research supported in part by the Hungarian National 
Research Grants OTKA 76481 and OTKA-NKTH 77778. 

Email addresses: marib@math.bme .hu (Marianna Bolla), koitomi@math.bme.hu 
(Tamas Koi), kramli@informatika.ilab.sztaki.hu (Andras Kramli). 



Preprint submitted to Elsevier 



11 January 2010 



reflect individual values of the vertices. Classical (simple) graphs have vertex- 
weights all equal to 1 and edge- weights or 1. 

For given number n of vertices and for a fixed integer < q < n we define 
three types of minimum g-way cut densities, each being the minimum of the 
weight-sum of between-cluster edges, occasionally adjusted with a factor char- 
acterizing within-cluster densities, over all or over balanced g-partitions of the 
vertices. The limit of these densities is considered as n — > oo. If this limit ex- 
ists for any convergent graph sequence, we say, that the g-way cut density in 
question is a testable graph parameter. In fact, the subsequent terms of such 
a convergent graph sequence (G n ) become more and more similar in their 
global structure, which fact can be formulated in terms of convergence of the 
homomorphism densities of injective maps F — > G n for any simple graph F. 

Hence, testable parameters measure statistical properties of a large graph 
that are indifferent to minor changes in the edge- and vertex-weights. It will 
be proved that certain balanced g-way cut densities are testable. To this end, 
notions of testability are extended to weighted graphs, and we prove equivalent 
statements of testability by means of large deviation results of Lovasz and 
coauthors jl]. Roughly speaking, these propositions state that if a smaller 
simple graph is selected - by an appropriate randomization - based on a large 
weighted graph, the testable parameter of the randomized one is very close to 
that of the whole graph with high probability. 

The organization of the paper is as follows. In Section 121 notion of a con- 
vergent graph sequence and that of a graphon is introduced based on [I]. 
In Section [3j equivalent statements of testability are discussed for weighted 
graphs. In Section HJ testability of different kinds of minimum multiway cut 
densities is investigated. For non testable ones counterexamples are presented, 
while for testable ones theorems of [5] based on statistical physics are applied. 
In Section [5J continuous extensions of testable weighted graph parameters to 
graphons are constructed that gives rise to a quadratic programming task. In 
Section [61 special graph sequences (blown up structures burdened with a very 
general kind of noise) are analyzed utilizing the fact that the cut-norm of a 
so-called Wigner-noise tends to zero as its size tends to infinity. 



2 Preliminaries 

Let G = G n be a weighted graph on the vertex set V(G) = {1, . . . , n) = [n] 
and edge set E{G). Both the edges and vertices have weights: the edge-weights 
are pairwise similarities = (3ji G [0, 1], i,j G [n], while the vertex- weights 
«i > (i G [n]) indicate relative significance of the vertices. It is important 
that the edge- weights are nonnegative (zero means no connection at all), the 
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normalization into the [0,1] interval is for the sake of treating them later as 
probabilities for random sampling. Let Q denote the set of all such weighted 
graphs. 

The volume of G G Q is defined by «g = X)£=i a ii while that of the vertex- 
subset T by a T = Y,i£T a i- Further, 

e G (S, T) = J2Y1 a sa t f3 s t 

denotes the weighted cut between the (not necessarily disjoint) vertex-subsets 
S and T. 

Lovasz and coauthors [I] define the homomorphism density between the simple 
graph F (on vertex set V(F) = [k]) and the above weighted graph G. With 
the notations 

k 

a * = II a *W' i n j$(^j G) — \\ 

i=l ijeE(F) 

the homomorphism density between F and G is defined by 

t(F,G) = -\ k £ a*.inj s (F,G). (1) 

\ a G) $; V (F)^V(G) 

For a simple graph G, t(F, G) is the probability that a random map V(F) — > 
V(G) is a homomorphism. Similarly, t in j(F,G) and ti n d(F,G) are defined in 
such a way that for a simple G, they are the probabilities that a random 
injective map V(F) — > V(G) is adjacency preserving and results in an induced 
subgraph of F in G, respectively. With the notation 

ind$(F, G) = Yi II (1 — 

ijGE(F) ijeE(F) 

let 

t inj (F, G) := J! ' m j$(^ G ) 

and 

t ind (F,G) := — l — ]T a$ • ind$(F, G), 

where (a)^ denotes the fcth elementary symmetric polynomial of oti, . . . ,a n . 
Latter one resembles to the likelihood function of taking a sample - that is a 
simple graph on k vertices - from the weighted graph G in the following way: 
k vertices are chosen with replacement with respective probabilities ctijctG 
(i = 1, . . . , n). Given the vertex-subset {$(1), . . . , $(&)}, the edges come into 
existence conditionally independently, with probabilities of the edge-weights. 
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Such a random graph is denoted by £(k, G). Obviously, 



P(£(fc, G) — F) — Y, «*md$(F, G), 

\ a G) <f> : v(F)^V(G) 



since we may get back F, even if $ is not injective. As most maps into a 
large graph are injective, the above probability is very close to t [Q d(F, G), and 
t(F, G) is very close to t in j(F, G). Further, t in d(F, G) has a well-defined relation 
to t in j(F, G) that will be formulated in Section 3. In the sequel only the k <Cn 
case makes sense, and this is the situation we need: k is kept fixed, while n 
tends to infinity. 

Definition 1 We say that the weighted graph sequence (G n ) is (left-) convergent, 
if the sequence t(F, G n ) converges for any simple graph F (n — > oo). 

As other kinds of convergence are not discussed here, in the sequel the word 
left will be omitted, and we simply use convergence. 

Authors in [I] also construct the limit object that is a symmetric, bounded, 
measurable function W : [0, 1] x [0, 1] — > K. and they call it graphon. Let 
W denote the set of these functions. The interval [0,1] corresponds to the 
vertices and the values W(x,y) = W(y, x) to the edge-weights. In view of the 
conditions imposed on the edge- weights, the range is also the [0,1] interval. The 
set of symmetric, measurable functions W : [0, 1] x [0, 1] — >■ [0, 1] is denoted 
by VV[o,ii- The stepfunction graphon Wq £ VVfai] is assigned to the weighted 
graph G G Q in the following way: the sides of the unit square are divided 
into intervals Ji, . . . , I n of lengths ai/ac • • • , oc n /aa, and over the rectangle 
Ii x Ij the stepfunction takes on the value fy. 

The so-called cut- distance between the graphons W and U is 

5 n (W,U) =inf \\W-U u \\ n (2) 

where the cut-norm of the graphon W is defined by 



\W\\n = sup 

S,TC[0,1] 



// W(x 1 y)dxdy 

J J SxT 



and the infimum in ()2]) is taken over all measure preserving bijections v : 
[0, 1] — > [0, 1], while U u denotes the transformed U after performing the same 
measure preserving bijection v on both sides of the unit square. An equiva- 
lence relation is defined over the set of graphons: two graphons belong to the 
same class if they can be transformed into each other by a measure preserving 
map, i.e., their <5n-distance is zero. In the sequel, we consider graphons mod- 
ulo measure preserving maps, and under graphon we understand the whole 
equivalence class. By Theorem 5.1 of [9], the classes of W[o,i] form a compact 
metric space with the <5 n metric. 



4 



We will intensively use the following reversible relation between convergent 
weighted graph sequences and graphons. 

Theorem 2 ( Corollary 3.9 of j^j). For any convergent sequence (G n ) of weighted 
graphs with uniformly bounded edge-weights there exists a graphon such that 
8n(Wc n , W) 0. Conversely, any graphon W can be obtained as the limit of 
a sequence of weighted graphs with uniformly bounded edge-weights. The limit 
of a convergent graph sequence is essentially unique: If G n — > W , then also 
G n — > W for precisely those graphons W for which ^□(W 7 , W) = 0. 

Authors of [4j also define the fc-distance of two weighted graphs and that of 
a graphon and a graph. Without going into details, we just cite the following 
facts: for the weighted graphs G, G', and for the graphon W 

5 n (G,G')=5 n (W G ,W G ,) and S n (W, G) = S n (W, W G ). 

They prove (Theorem 2.6) that a sequence of weighted graphs with uniformly 
bounded edge-weights is convergent if and only if it is a Cauchy sequence in 
the metric 8u- 

A simple graph on k vertices can be sampled based on W in the following way: 
k uniform random numbers, X\, . . . , X^ are generated on [0,1] independently. 
Then we connect the vertices corresponding to X,i and Xj with probability 
W(Xi,Xj). For the so obtained simple graph £(k, W) the following large de- 
viation result is proved. 

Theorem 3 (Theorem 4-7 of ^j, part (ii)). Let k be a positive integer and 
W E VV[o,i] be a graphon. Then with probability at least 1 — e~ k ^ 2lo ^ k \ we 
have 

5 u {W,i{k,W))<^=. (3) 
V lo g2 k 

Fixing k, the inequality (jSJ) holds uniformly for any graphon W G W[o,i], 
especially for W G - Further, the sampling from W G is identical to the previously 
defined sampling with replacement from G, that is £(k, G) = £(k, W G ). In fact, 
this argument is relevant in the k < |V(Cr)| case. 



3 Testable weighted graph parameters 

A function / : G — > M is called a graph parameter if it is invariant under iso- 
morphism. In fact, a graph parameter is a statistic evaluated on the graph, and 
hence, we are interested in weighted graph parameters that are not sensitive 
to minor changes in the weights of the graph. 
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The testability results of [I] for simple graphs remain valid if we consider 
weighted graph sequences (G n ) with no dominant vertex-weights, that is 

Oii(G n ) 

max > 0, n — > oo. 

Definition 4 ^4 weighted graph parameter f is testable if for every e > 
t/tere is a positive integer k such that if G e Q satisfies 

a t (G) 1 
max < — , 

i OLG k 

then 

n\f(G)-fm,G))\>e)<e, (4) 

where £(k,G) is a random simple graph on k vertices selected randomly from 
G with replacement as described in Section 2. 



Consequently, such a graph parameter can be consistently estimated based on 
a fairly large sample. As the randomization depends only on the ai(G)/ac ra- 
tios, it is not able to distinguish between weighted graphs whose vertex-weights 
differ only in a constant factor. Thus, a testable weighted graph parameter is 
invariant under scaling the vertex-weights. Now, we introduce some equiva- 
lent statements of the testability, indicating that a testable parameter depends 
continuously on the whole graph. This is the generalization of Theorem 6.1 
of [1] applicable for simple graphs. 

Theorem 5 For the weighted graph parameter f the following are equivalent: 

(a) f is testable. 

(b ) For every e > there is a positive integer k such that for every weighted 
graph G G Q satisfying the node- condition maxj ai(G)/aa < 

|/(G)-E(/(e(fc,G)))|<e. 

(c) For every convergent weighted graph sequence (G n ) with max, di(G n ) / ac n — > 
0, f{G n ) is also convergent (n—^-oo). 

(d) f can be extended to graphons such that the graphon functional f is continu- 
ous in the cut-norm and f '(Wg„) — f \G n ) — > 0, whenever maxj ai(G n ) / «g„ — > 
(n — > oo). 

(e) For every e > there is an Eq > real and an n > integer such that if 
G\,G 2 are weighted graphs satisfying ma.XiCti(Gi)/ ad < l/n 0; max, aj(G 2 )/aG 2 < 
l/n , and 5 D (G 1 ,G 2 ) < e , then - f(G 2 )\ < e. 

To prove the theorem we need three lemmas that are partly generalizations of 
results in [1] stated for simple graphs. 
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Lemma 6 If (G n ) is a weighted graph sequence with no dominant vertex- 
weights, then for any simple graph F 

\t(F, G n ) - t inj (F, G n )\ 0, n ->• oo. 



PROOF. The difference between between t(F,G n ) and ti n j(F,G n ) is essen- 
tially obtained by the summation in ([1]) over the non injective maps. As the 
sum is increased if we take the non- zero fy's 1, it suffices to prove that 

<E> non inj. O n 

where the left hand side is the probability that there are repetitions in the 
vertices. As maxj cti{G n ) / ac n < £ n , this probability is less than 

1 - 1 • (1 - e n ) ... (1 - (k - IK) < 1 - (1 - (k - l)e n ) k 

that tends to 0, since e n — > 0, provided k is fixed. 



Lemma 7 

t inj (F,G)= J2 tUF',G) and t ind {F, G) = £ (-l)^ F '^ F \ nj (F' , G) 

F'^F F'DF 

where F' is a simple super-graph of F (on the same vertex-set, and edge-set 
containing the edge-set of F). 



PROOF. To verify the first statement, it suffices to prove that for any injec- 
tive map $ : V(F) -»• V(G), 

inj,(F,G)= E ind $ (F',G). (5) 

F'DF 

Suppose that £ edges are missing from F to be a complete graph. As F' D F is 
a super-graph of F, it can easily be seen that ind$(F', G) contains the multi- 
plicative factor inj $ (F, G). Hence, the right hand side of © can be factorized 
as inj$(F, G) ■ Si, where Si depends on F, G, and $, but for the sake of sim- 
plicity we omit these ones. We show - by reverse induction on the number 
of edges - that Se = 1. If F is a complete graph on k vertices, then by the 
definition of inj$ and ind$, Sq = 1. If j edges are missing from F to be a 
complete graph, denote by (3i, . . . , (3j the weights of their $-images. Further, 
denote by a = (a±, . . . , <7j) a {0,1} sequence of length j and af\x) := x, 

ap{x) :— 1 — x, % = 1, . . . ,j. With this notation 5^ = J2ae{o,iy Ui=i a i(Pi)- 
Coupling the sequences which differ only in the first coordinate, Sj reduces to 
5j_x, etc. 
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By inclusion-exclusion, the second statement also follows. 

Lemma 8 (Lemma 5.3 of UV). Let (G n ) be a sequence of weighted graphs with 
uniformly bounded edge-weights, and no dominant vertex- weights. If5o{U, Wa n ) — > 
for some U G W, then the graphs in the sequence (G n ) can be relabeled in 
such a way that the resulting sequence (G' n ) of labeled graphs converges to U 
in the cut-norm: II U — Wn> II n 0. 

Now, we are able to prove the main theorem (Theorem [5]). 

PROOF. The idea of the proof is analogous to that of Theorem 6.1 of [I]. 
First we prove that (a),(b),(c),(e) are equivalent: 

(a) =>- (6): The statement is obvious, as due to the boundedness of /, (TjJ 
implies that the difference is small on average. 

(6) =>- (c): Let (G n ) be a convergent sequence of weighted graphs with no 
dominant vertex-weights. Let e > be arbitrary, and k is chosen corresponding 
to e as in statement (b). If n is large enough, then \f{G n )— E(/(£(/c, G n )))\ < e. 
On the other hand, by the definition of convergence it follows that t(F, G n ) is 
convergent for all simple graphs F on k vertices. Using Lemmas £md(F, G n ) 
tends to a limit value denoted by ti n d(F). This means that P G n ) — F) — > 
iind(F) and so 

E(/(£(fc, G n ))) £ t ind (F) ■ f(F) = a k , 

F: \V(F)\=k 

since the number of simple graphs on k vertices is finite. In summary, 

\f(G n ) -a k \< \f(G n ) - E(/(£(*, G n )))\ + |E(/(e(fc, G n ))) -a k \<2e 
provided n is large enough. 

(c) =>■ (e): Suppose that (e) does not hold. In this case there exist e > 0, further 
sequences (G n ) and (G n ) of weighted graphs, such that the dominant vertex- 
weights of both sequences tend to 0, 5a(G n , G n ) — > 0, and \f(G n ) — f(G n )\ > e. 
Using the compactness of W[o,i] we can assume that both sequences are con- 
vergent. For this reason, the merged sequence Gi, G 1 , G 2 , G' 2 , ■ ■ ■ is also conver- 
gent. For the above merged sequence, by (c), the sequence f(G\), /(Gj), fiGz), f(G' 2 ), . . . 
is covergent, that contradicts to \f(G n ) — f(G n )\ > e. 

(e) =>■ (a): Suppose that (a) does not hold. In this case there exist e > 
and a sequence (G n ) such that maxj "^"^ < ^, and with probability at least 
e the inequality \f(G n ) — f(£(n,G n ))\ > e holds for all n. To this e choose 
the corresponding n and Eq as in the statement (e). Furtheron, because of 
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(|3]), the sequence #n(G n , £(n, G n )) tends to in probability. In particular, 
P (Sa(G n , £(n, G n )) < e ) > 1 — |. Using the definition of e and n we get 
that F(\f(G n ) - f{£(n,G n ))\ < e) > 1 - §. This contradicts to the fact that 
with probability at least e the opposite is true. 

Now we prove that the statement (d) is also equivalent to the testability. 

(c) , (e) =>- (d): Let W G W[o,i] be an arbitrary graphon. By Theorem [2] we 
can find a sequence (G n ) of weighted graphs with no dominant vertex-weights 
tending to W. Let f{W) be the limit of f{G n ). Because of (c) the limit exists, 
and due to the statement (e) this definition is correct. First we prove the 
continuity. Let e > be arbitrary. Using the statement (e), to | we assign the 

corresponding e and n . We show, that \\W — W'\\a < — implies \f(W) — 
f(W ) | < e. For this purpose let (G n ) be a sequence of weighted graphs with no 
dominant vertex-weights tending to W. We can choose a G from (G n ) such that 

the dominant vertex- weight of G is smaller than further, 5q(G, W) < |- 
and \ f(G) — f(W)\ < |. Similarly, we can choose a G' from the sequence (G' n ) 
tending to W with analogous properties. In this case 5n(G, G') < S\j(G, W) + 
5 D (W,W') + 5 D (W',G') < e. By (e), \ f(G) - f(G'\ < |, and hence, 

\f(W) - f(W')\ < \f(W) - /(G) I + \f(G)\ - f(G')\ + |/(G') - f(W')\ < e. 

It remains to show that f{Wa n ) — f{G n ) — > 0, whenever |y(G n )| — > oo 
with no dominant vertex-weights. On the contrary, suppose that there exists 
a sequence (G n ) with no dominant vertex- weights such that f{Wc n ) — f{G n ) 
does not tend to 0. For the sake of simplicity we can assume that for some 
e '- f(Wa n ) — f(G n ) > e for all n. We can also assume that (G n ) converges 
to some graphon W in the <5n metric. By Lemma there is a sequence (G' n ) 
isomorphic to {G n ) such that \\W G > — W\\a —> 0. Using the statement (c), 

lim n ^oo f(G n ) = lim^oo f(G' n ) = fiW). In addition, / is continuous, and 
for this reason, lining j{W Gn ) = linv^ f{W G >J = f{W). But this is a 
contradiction. 

(d) =^ (c): Let (G n ) be a convergent sequence of weighted graphs with no 
dominant vertex-weights. Let W be its limit. So So(WG n , W) —> 0. In this way, 
by Lemma [HI we can relabel (G n ) into (G n ) in such a way that \\W G > — W\ \u — > 

0. Therefore, using the continuity of / we get f(W G > ) — f(W) —> 0. Since 

f{G n ) — f(W G > ) = }'{G n ) — f(Wc n ), the last term tends to because of the 
statement (d). Thus, f{G n ) is convergent. 

Remark 9 The original testability theorem for simple graphs in ^ was for- 
mulated in terms of sampling without replacement. In the most important case, 
when the size of the sample is small compared to the size of the underlaying 
graph, the two sampling methods are approximately the same. Usually, this 
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is the case in practical applications. In our definition of the testability of a 
weighted graph parameter we use sampling with replacement, but the testabil- 
ity could be defined by any randomization for which a large deviation result 
similar to that of Theorem^ holds. However, equivalent statements (c), (d), 
(e) do not depend on the randomization, and we may expect their equivalence 
to statemants (a),(b) under an appropriate sampling with likelihood function 
strongly connected to t in( i(F, G) and satisfying |3j). 



4 Balanced multiway cuts 



Lovasz and coauthors [5] proved the testability of the maximum cut density. 
The minimum cut density is somewhat different. E.g., if a single vertex is 
loosely connected to a dense part, the minimum cut density of the whole 
graph is small, however, randomizing a smaller sample, with high probability, 
it will come from the dense part with a large minimum cut density. 

To prove the testability of certain balanced minimum multiway cut densities 
we use the notions of statistical physics in the same way as in [5j. Most of 
these notions are self-explanatory. However, to be self-contained, we included 
some definitions for clarification together with the notion of a factor graph. 

Let G G Q be a weighted graph on n vertices with vertex- weights 
and edge- weights /%'s. Let q < n be a fixed positive integer, and V q denote 
the set of g-partitions P = (Vi, . . . , V q ) of the vertex set V. The non-empty, 
disjoint vertex-subsets sometimes are referred to as clusters or states. The 
factor graph or q- quotient of G with respect to the g-partition P is denoted 
by G/P and it is defined as the weighted graph on q vertices with vertex- and 
edge- weights 

a t (G/P) = ^ (ie[q\) and ^(G/P) = 6g( ^ Vj) (i,je[q\), 
a G a Vi a Vj 

respectively. Let <S q (G) denote the set of all g-quotients of G. The Hausdorff 
distance between S q (G) and S q (G') is defined by 

d m (S q (G),S q (G')) = max{ sup inf d^H, H') , sup inf d x {H,H')}, 

HeS q (G) H' es q (G> ) H , e s 9 {G , ) HdS q (G) 



where 

d 1 (H,H')= £ 

*,ie[g] 



a t (H) aj {H)fiij{H) a t (H>) a 3 {H')^ (H 1 ) 



Ct H OLjji 



+E 



Oi(H) a^H 1 ) 



a H ctw 

is the Zi-distance between two weighted graphs H and H' on the same number 
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of vertices. Here especially, H and H' are factor graphs, and hence, an = 
an' = 1, therefore the denominators can be omitted. 

Given the real symmetric q x q matrix J and the vector h G M. q , the partitions 
P G V q also define a spin system on the weighted graph G. The so-called 
ground state energy of such a spin configuration is 

4(C,J,h) = - max I J2<G/P)h i + £ a i (G/P)a j (G/P)/3 ij (G/P)J ij J . 

Here J is the so-called coupling-constant matrix, where represents the 
strength of interaction between states % and j, and h is the magnetic field. 
They carry physical meaning. We shall use only special J and h, especially 
h = 0. 



Sometimes, we need balanced g-partitions to regulate the proportion of the 
cluster volumes. A slight balancing between the cluster volumes is achieved by 
fixing a positive real number c (c < 1/q). Let V c q denote the set of g-partitions 
of V such that > c i E [q], or equivalently, c < ^ < ^ (i ^ j). 

A more accurate balancing is defined by fixing a vector a — ( Qj\ , . . . , CL„ ) 
with components forming a probability distribution over [q]: a, > i £ [q], 
Sf=i °i — 1- Let denote the set of g-partitions of V such that f-^S • • • , 
is approximately a-distributed, that is 



«G 



< (i = l,...,q), 

OiG 



the right hand side tending to as |V(G)| — > oo for weighted graphs with no 
dominant vertex-weights. 

The microcanonical ground state energy of G given a and J (h = 0) is 

^(G,J) = -max £ a i (G/P)a j (G/P)p ij (G/P)J ij . 

q i,je[q] 

Remark 10 In Theorem 2.14 °f W$ ^ ^ s proved that the convergence of the 
weighted graph sequence (G n ) with no dominant vertex-weights is equivalent 
to the convergence of its microcanonical ground state energies for any q, a, 
and J. Also, it is equivalent to the convergence of its q-quotients in Hausdorff 
distance for any q. 

Remark 11 Under the same conditions, Theorem 2.15 of states that the 
convergence of the above (G n ) implies the convergence of its ground state en- 
ergies for any q, J, and h; further the convergence of the spectrum of (G n ). 
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Using these facts, we investigate the testability of some special multiway cut 
densities defined in the forthcoming definitions. 

Definition 12 The minimum q-way cut density of G is 

H Or i=l j=i+l 

the minimum c-balanced q-way cut density of G is 

f ^ G) = ¥iz ^ £ £> ec{y - v ^ (6) 

re> q a G i=1 j=i+1 
and the minimum a-balanced q-way cut density of G is 

i a G i=i j=i+i 



Occasionally, we want to penalize cluster volumes that wildly differ. For this 
purpose we herein introduce the notions of weighted minimum cut densities. 

Definition 13 The minimum weighted q-way cut density of G is 

9-1 q y 

N(G) = min E — • ^(VuVj), 

PeV i i=l j=i+i a Vi ■ a Vj 

the minimum weighted c-balanced q-way cut density of G is 
K(G)=mmX; t - 1 —-ec<V i ,V j ), 

- rt '? i=lj=i+l U Vi ^Vj 

and the minimum weighted a-balanced q-way cut density of G is 
<(G)=min£ £ -L^.e^V,). 

i i=i j=i+l u Vi «Vj 

Proposition 14 f q (G) is testable for any q < \V(G)\. 



PROOF. Observe that f q (G) is a special ground state energy: 

/ g (G)=4(G,J,0), 

where the magnetic field is and the q x q symmetric matrix J is the following: 
J a = i G [q], further Jij — — 1/2 (i ^ j). By Remark ITUl and the equivalent 
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statement (c) of Theorem [5j the minimum g-way cut density is testable for 
any g. 



However, this statement is of not much use, since f q (G n ) — > 0, in the lack of 
dominant vertex-weights. In fact, the minimum g-way cut density is trivially 
estimated from above by 

f i s~i \ ^ i i \ ^max (Gri) | I q 1 j I Ot max (G n ) ^ 

Jqy^n) S yq — J-J- 



«g„ V 2 J \ a Gn J 

that tends to provided a m£LX (G n )/aG n — > as n — > oo. 

Proposition 15 fq(G) is testable for any q < \V(G)\ and distribution a over 
[?]■ 



PROOF. Choose J as in the proof of Proposition [T^l In this way, f*(G) is a 
special microcanonical ground state energy: 

f*(G) = £*(G,J). (7) 

Hence, by Remark (TUJ the convergence of (G n ) is equivalent to the convergence 
of fq(G n ) for any q and any distribution a over [q]. Therefore, by the equivalent 
statement (c) of Theorem [5J the testability of the minimum a-balanced g-way 
cut density also follows. 

Proposition 16 ffj(G) is testable for any q < |V(G)| and c < 1/q. 



PROOF. Theorem 4.7 and Theorem 5.5 of [5] imply that for any two weighted 
graphs G, G 1 

l4 a (G,J)-4 a (G',J)| < (3/2 + K)-d m (S q (G),S q (G')), (8) 

where k = o(mm{\V(G)\, \ V(G')\}) is a negligible small constant, provided the 
number of vertices of G and G' is sufficiently large. By Remark [TT] we know 
that if (G n ) converges, its g-quotients also converge in Hausdorff distance, 
consequently form a Cauchy-sequence. This means that for any e > there 
is an iVo such that for n,m > N : d m (S q (G n ), S q (G m )) < e. We want to 
prove that for n,m > N : \ f q {G n ) — f q {G m )\ < 2e. On the contrary, suppose 
that there are n,m > N such that \f c q {G n ) - f c q {G m )\ > 2e. Say, f c q {G n ) > 
f q {G m ). Let A := {a : a« > c , i = 1, . . . , g} is the subset of special c-balanced 
distributions over [g]. On the one hand, 

f c q (G m ) = mm f*(G m ) = ff (G m ) 
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for some a* e A. On the other hand, by © and ©, /**(G n ) - ff{G m ) < 
(| + re)£, that together with the indirect assumption implies that f q {G n ) — 
ff(G n ) > (~ — re)e > for this a* G A. But this contradicts to the fact that 
fq(G n ) is the minimum of 's over A. Thus, f q (G n ) is also a Cauchy 

sequence, and being a real sequence, it is also convergent. 

Concerning the penalized densities, trivially, 

it: In. -i . . . -i 

» 1=1 J=l-\-\ 

In fact, n q (G) is not testable as we can show an example where /i q (G n ) — > 0, 
but randomizing a sufficiently large part of G n , the weighted minimum g-way 
cut density of that part is constant. The example is for q = 2 and for a simple 
graph on n vertices such that order of ^/n vertices are connected with a single 
edge to the remaining vertices that form a complete graph. Then p,2(G n ) —> 0, 
but randomizing a sufficiently large part of the graph, with high probability, it 
will be a subgraph of the complete graph, whose minimum 2-way cut density 
is of constant order. 

Proposition 17 /^(G) is testable for any q < \V(G)\ and distribution a over 
[?]■ 

PROOF. By the definition of Hausdorff distance, the convergence of g-quotients 
guarantees the convergence of 

A$(G)= minX: E MG/P) (9) 
fc/ 1 j=i j=t+i 

for any a and q in the following way. Let Sg(G) denote the set of factor graphs 
of G with respect to partitions in V*. As a consequence of Lemma 4.5 and 
Theorem 5.4 of [5], for any two weighted graphs G, G' 

maxd Hf (S*(G),S*(G')) < (3 + re) • d Hf (S q (G),S q (G')), (10) 
where re = o(min{|V(G)|, \V(G')\}). 

By Remark [TUl for a convergent graph- sequence (G n ), the sequence S q (G n ) 
converges, and by the inequality (TTOT) . Sg(G n ) also converges in Hausdorff 
distance for any distribution a over [q]. As they form a Cauchy sequence, Ve 
3iVo such that for n,m > N Q 

d H f(S*(G n ),S»(G m ))<e 
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uniformly for any a. In view of the Hausdorff distance's definition, this means 
that for any g-quotient H G S q (G n ) there exists (at least one) g-quotient 
H 1 G <S q (G m ), and vice versa, for any H' G S q {G m ) there exists (at least one) 
H G <S q (G n ) such that di(H, H') < e. (In fact, the maximum distance between 
the elements of the above pairs is less than e. Note that the symmetry in the 
definition of the Hausdorff distance is important: the pairing exhausts the sets 
even if they have different cardinalities.) 

Using the fact that the vertex-weights of such a pair H es H' are almost the 
same (the coordinates of the vector a), by the notation a = mim^] a iy the 
following argument is valid for n, m large enough: 

2a 2 £ |/%(#) - P tJ (H')\ < J2 * 2 \^{H) ~ /%(#')! < 

%° ^ (11) 

< J2 \^(H)a 3 (H)f3 t ,(H) - a l (H')a j (H')f3 ij (H')\ = d 1 (H,H') < e. 

Therefore 

I E t N{h) - E E W)\ < A ■= ^ 

i=l j=i+l i=l j=i+l Za 

and because X^=i+i Pij(H) and Sf=i Z)j=i+i Pij(H') are individual terms 
behind the minimum in (Q, the above inequality holds for their minima over 
Vg as well: 

\^(G n )-^(G m )\<e'. (12) 
Consequently, the sequence /ig(G n ) is a Cauchy sequence, and being a real 
sequence, it is also convergent. Thus jjft is testable. 

Remark 18 The testability of fi q , apparently, does not follow in the same way 
due to presence of distinct vetex-weights in H and H' . Thus, the smallness of 
di(H,H') does not imply the closeness of their edge-weights. 

However, as the testability of /® implied the testability of the testability 
of ^ also implies the testability of n c q . 

Proposition 19 ^ c q {G) is testable for any q < \V{G)\ and c < 1/q. 



PROOF. The proof is analogous to that of Proposition[16]using equation ( JT2l) 
instead of equation ([8]). By the pairing argument of the proof of Proposition [TTf 
the real sequence n q (G n ) is a Cauchy sequence, and therefore, convergent. This 
immediately implies the testability of \i c q . 

By Remark [TTJ the convergence of {G n ) also implies the convergence of the 
spectra, though the convergence of the spectrum itself is weaker than the 
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convergence of the graph sequence. Without going into details, we remark 
that in [2], f q (G) and n q (G) were bounded from below by the q smallest 
Laplacian eigenvalues of G. An upper estimate can also be constructed and 
we conjecture that in case of testable parameters an asymptotic estimate is 
also valid. 



5 Minimum cut as a quadratic programming problem 



In Section 4, we proved that f£ is a testable weighted graph parameter. Now, 
we extend it to graphons. 

Proposition 20 Let us define the graphon functional f£ in the following way: 



q-l q 



/ 9 c W:=inf££ // W(x,y)dxdy=mif q (W;S 1 ,...,S q ) (13) 

where the infimum is taken over all the c-balanced Lebesgue-measurable par- 
titions Q = (Si, . . . , S q ) of [0,1]. For these, Sf = iA(Si) = 1 and X(Si) > c 
(i G [q]), where A denotes the Lebesgue-measure, and Q c q denotes the set of 

c-balanced q-partitions of [0,1]. We state that f£ is the extension of fg in the 
following sense: If (G n ) is a convergent weighted graph sequence with uni- 
formly bounded edge-weights and no dominant vertex- weights, then denoting 
by W the essentially unique limit graphon of the sequence (see Theorem^, 
f q (G n ) -+ t q {W) asn^oo. 



PROOF. First we show that f° is continuous in the cut-norm. As f q (W) is 
insensitive to measure preserving maps of W, it suffices to prove that to any e 
we can find e' such that for any two graphons W, U with \\W — U\\q < e', the 
relation \f q (W) — f q (U)\ < e also holds. By the definition of the cut-norm, for 
any Lebesgue-measurable g-partition (Si, . . . , S q ) of [0,1], the relation 

(W(x,y)-U(x,y))dxdy\<e' (i ^ j) 
holds. Summing up for the i ^ j pairs 

IE E // W(x,y)dxdy- q j2 E // U(x,y)dxdy\<( q )e'. 

1=1 j=i+l JJ Si*Sj i=1 j =i+ i JJSiXSj \Zy 



(14) 



Therefore 



r5 E E // W(x,y)dxdy> inf £ E fi U(x,y) dx dy-( q )e' 

(Si,..,s,,)es^ j=1 j =i+1 JJSixSj (Si,..,s,)es= j=1 j =i+1 JJSixSj \zj 



16 



and vice versa, 




Consequently the absolute difference of the two infima is bounded from above 
by («)e'. Thus, e' = e/(|) will do. 

Let (G n ) be a convergent weighted graph sequence with uniforly bounded edge- 
weights and no dominant vertex- weights. By Theorem [21 there is an essentially 
unique graphon W such that G n — > W, i.e., So{Wg„, W) — > as n — > oo. By 
the continuity of f q , 

£(WoJ £0*0, n oo. (15) 

Suppose that 

/^ G J = / g (^ Gn ;^,...,5*), 

that is the infimum in ffT3l) is attained at the c-balanced Lebesgue-measurable 
g-partition (S* . . . , S*) of [0,1]. 

Let G* be the g-fold blown-up of G n with respect to (S*, . . . , S*). It is a 
weighted graph on at most nq vertices defined in the following way. Let 
Ii,...,I n be consecutive intervals of [0,1] such that X(Ij) = aj(G n ), j = 
1, . . . , n. The weight of the vertex labeled by ju of G* is X(Ij HS 1 *), u G [q], j G 
[n], while the edge- weights are fij U ,iv{G* nq ) = (3ji(G n ). Trivially, the graphons 
Wg„ and Wg* essentially define the same stepfunction, hence / c (WgJ = 
f c q {W Gu ). Therefore, by (USD, 

jJ(W^J F q (W), n->oo. (16) 
As 5 D (G n , G* nq ) = 5 n (W Gn , W G * q ) = 0, by part (e) of TheoremEJit follows that 
\f c q (G* nq ) - f c q (G n )\ ^ 0, n^oc. (17) 
Finally, by the construction of G* nq , f q {W G * nq ) = f q {G* nq ), and hence, 

\f c q (G n ) - r q {w)\ < \r q {G n ) - r q (G*j\ + \r q (w G *j - r q (w)\ 

that, in view of ( TTB"|) . f[T7|) . implies the required statement. 

Corollary 21 In Section 3, while proving Theorem^ an essentially unique 
extension of a testable graph parameter to graphons was given. By Propo- 
sition^^ the above f q is the desired extension of f q , therefore part (d) of 
Theorem [3] is also applicable to it: For a weighted graph sequence {G n ) with 
maxj "^"^ — > 0, the limit relation f q (WG n ) — fq{G n ) — >■ also holds as 
n — > oo. 
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Corollary [21] gives rise to approximate the minimum c-balanced g-way cut 
density of a weighted graph on "many" vertices with no dominant vertex 
weights by the extended c-balanced g-way cut density of the stepfunction 
graphon assigned to the graph. In this way, the discrete optimization problem 
can be formulated as a quadratic programming task with linear equality and 
inequality constraints. 

To this end, let us investigate a fixed weighted graph G on n vertices (n is 
large). To simplify notation we drop the subscript n, and G in the arguments 
of the vertex- and edge- weights. As f q (G) is invariant under the scale of the 
vertices, we can suppose that «g = J2f =1 a, = 1. As /3y G [0, 1], Wq is uni- 
formly bounded by 1. Recall that Wg(x, y) = if x G Ij, y G Ij, where 
X(Ij) = ctj (j = 1, . . . , n) and Ii, . . . , I n are consecutive intervals of [0,1]. 

For fixed q and c < 1/q, f q (G; V x , ■ ■ ■ , V q ) = £?=i EjU+i e G (V^, V^-) is a 
function taking on discrete values over c-balanced g-partitions P = (Vi, . . . , V q ) G 
Vg of the vertices of G. As n — >■ oo, by Corollary [2U this function approaches 
fqiWc] S\, . . . , S q ) that is already a continuous function over c-balanced q- 
partitions Q = (Si, . . . , S q ) G Q q of [0,1]. In fact, this continuous function can 
be regarded as a multilinear function of the variable 

x = (^11) • • • j Xi n , X21, • • • , X2n, ■ ■ ■ , X q i, . . . , X qn ) T G 

where the coordinate indexed by ij is 

Xij = X(Si n Ij), j = l,...,n; i = l,...q. 

Hence, 

q—l q n n j 

/ g (W G ; 5i, . . . , 5,) = / ? (x) = E E EE Xijxwfe, = -x T (A ® B)x, 

i=i i'=i+i j=i j'=i z 

where - denoting by l qxq and I qxq the g x q all l's and the identity matrix, 
respectively - the eigenvalues of the q x q symmetric matrix A = l qxq — I qxq 
are the number q — l and -1 with multiplicity q — l, while those of the n x n 
symmetric matrix B = are A x > • • • > A n . Latter one being a Frobenius- 
type matrix, Ai > 0. The eigenvalues of the Kronecker-product A®B are the 
numbers (g— l)Aj {i = 1, . . . ,n) and — \ with multiplicity g — 1 {i = 1, . . . ,n). 
Therefore the above quadratic form is indefinite. 

Hence, we have the following quadratic programming task: 

minimize 7g( x ) = -x T (A ® B)x 

q n (18) 

subject to x > 0; E x «i = a i e N)> E- 1 ^' — c (* e M)- 

i=i j=i 
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The feasible region is the closed convex polytope of (1TB]) . and it is, in fact, 
in an n(q — l)-dimensional hyperplane of M. nq . The gradient of the objective 
function V/ 9 (x) = (A ® B)x cannot be in the feasible region, provided the 
weight matrix B, and hence A <g> B is non singular. 

The arg-min of the quadratic programming task (fTHl) is one of the Kuhn- 
Tucker points (giving relative minima of the indefinite quadratic form over 
the feasible region), that can be found by numerical algorithms (by tracing 
back the problem to a linear programming task), see [JJ. 

Eventually, we give the extension of the testable weighted graph parameter \f q 
to graphons. 

Proposition 22 Let us define the graphon functional fi c q in the following way: 

F q {W) := inf EE// women W ^ V) dx d V- 
QzQq^^JJSixSj \{Si)\{Sj) 

We state that Jx q is the extension of fi q in the following sense: If (G n ) is a 
convergent weighted graph sequence with uniforly bounded edge-weights and 
no dominant vertex-weights, then denoting by W the essentially unique limit 
graphon of the sequence (see Theorem^), fj, q (G n ) —> fi q {W) as n — > oo. 

The proof is analogous to that of Proposition [20J, after we have proved that 
jl q is continuous in the cut-norm. In fact, with estimates, analogous to (JHJ), 
e ' = ec 2/(V) w iH do. 

Consequently, p, q (Wa) — Hq(G) — > as V(G) —> oo with no dominant vertex- 
weights. This fact also gives rise to approximate the minimum c-balanced 
weighted g-way cut density of a large graph by quadratic programming meth- 
ods. 



6 Convergence of noisy graph sequences 

Now, we use the above theory for perturbations. If not stated otherwise, the 
vertex- weights are equal (say 1), and a weighted graph G on n vertices is 
identified with its n x n symmetric weight matrix A. Let Ga. denote the 
weighted graph with unit vertex-weights and edge-weights that are entries of 
A . 

Definition 23 Let (1 < % < j < n) be independent random variables 
defined on the same probability space, and Wji = Wij. E(wjj) = (Vz, j) and 
the Wij's are uniformly bounded, i.e., there is a constant K > - that does 
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not depend of n - such that \wij\ < K , The nxn symmetric real random 

matrix W = (wij)i<i<„, i<j<„ is called a Wigner-noise. 

The corresponding edge- weighted graph G w is called a Wigner- graph. To in- 
dicate that the size n is expanding, we use the notations W n and Gw„. 

Definition 24 The nxn symmetric real matrix B is a blown-up matrix, if 
there is a q x q symmetric so-called pattern matrix P with entries < p i3 < 1, 
and there are positive integers rii, . . . , n q with X)f=i n i — n > suc h that - after 
rearranging its rows and columns - the matrix B can be divided into q x q 
blocks, where block is an ni x rij matrix with entries all equal to pij 



Fix P, blow it up to an n x n matrix B n , and consider the noisy matrix 
A n = B„ + W n as ni, ■ ■ ■ , n q — > oo at the same rate. 

Remark 25 While perturbing B n by W n; for the uniform bound of the entries 
o/W n the condition 



is satisfied. In this way, the entries of A n are in the [0,1] interval, and hence, 



We remark that Gw„ 4- @> but WG Wn G W and the theory of bounded graphons 
applies to it. By adding an appropriate Wigner-noise to B n , we can achieve 
that A n becomes a 0-1 matrix: its entries are equal to 1 with probability p^ 
and otherwise within the block of size ni x rij ( after rearranging its rows and 
columns). In this case, the corresponding noisy graph Ga„ is a random simple 
graph. 

As rank (B„) = q and ||W n || = 0{y/n) almost surely (n — > oo), the noisy 
matrix A n almost surely has q protruding eigenvalues (of order n), and all 
the other eigenvalues are of order y/n, there is a spectral gap between the q 
largest and the other eigenvalues A n . 

Let X n = (xi, . . . , x 9 ) be the n x q matrix containing the eigenvectors belong- 
ing to the q protruding eigenvalues of A n in its columns. The rows of X n , that 
is the vectors x 1 , . . . ,x n e W are regarded as g-dimensional representatives 
of the vertices of Ga„ • The q- variance of the representatives is 



(1 <i,j <n). 



K < min{ min p^ , 1 — max 



(19) 



i 



s 2 q (* n ) = E E ll xJ 



- X 




i=i jeVi 



where x* = — 
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In [3] we proved that 

1 



^(Xn) = O 

almost surely, under the growth condition rii/n > c (i — 1, . . . , q). 

In the other direction: for sufficiently large n, under some conditions, we can 
separate an nxn symmetric "error-matrix" E from A, such that ||E|| = 0(y/n) 
and the remaining matrix A — E is a blown- up matrix B of "low rank". 
Consequently, Gb is a weighted graph with homogeneous edge-densities within 
the clusters (determined by the blow-up). It resembles to the weak Szemeredi- 
partition, cf. [9], but the error-term is bounded in spectral norm, instead of 
the cut-norm. However, by large deviations, we can prove that the cut-norm 
of a Wigner-graph tends to zero almost surely as n — > oo. 

Theorem 26 For any sequence (GwJ of Wigner-graphs 

lim ||Wg w J|n = (n -> oo) 

almost surely. 

To prove the theorem, we need a proposition that is an easy consequence of 
Azuma's martingale inequality, see Theorem 5.3 of [6]. 

Proposition 27 Let Xi, X 2 , . . . be i.i.d. random variables with zero mean 
and < 1, i — 1, . . . , n. Then 



P 



N 



> 7iV I < 2exp (-—L) , 0<7<1. (20) 



PROOF. Now we are ready to prove Theorem [26j By the definition of the 
cut-norm of a stepfunction graphon and [8], 



i 



Gw„ II □ 



max 



n 2 U,Tc\n 



EE 

ieUjeT 



w 



'J 



(21) 



We remark that maxrj iTc [ n] T,ieuT,jeT w ij 
defined in [7]. 



is the cut-norm of the matrix W r 



To make the entries behind the double sum of ( 1211) independent, we use for- 
mulas (7.2), (7.3) of [I]: 



||W Gw J| n < 6 max -2 



1 

uc[n] n 2 



E E 

i£U j£[n]\U 



W; 



(22) 
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Apply Proposition |27] for a subsequence of length N of entries of W n which 
does not contain Wij and Wji simultaneously. Namely, i G U, j G [n] \ U, 
N=\U\-(n- \U\). Remark that n-l< N < n 2 /A. 

We distinguish between two cases. 

• Case 1. Suppose that iV < n 3//2 and apply Proposition [23 for the right hand 
side of (EU) with 7 = n 4 / 10 : 



P I 

n z 



E E w n 


> n- 1 ^ 


= P ( 


E E w n 




ieu je[n]\u 






ieu je[n]\u 





< 



E E 

i&J j€[n]\U 



W 



13 



> 



(23) 

In the first inequality we used the condition iV < n 3//2 , while the second 
inequality follows from (120]) and the fact that N > n — 1. 
Case 2. Now suppose that iV > r?l 2 and apply Proposition [27] for the right 
hand side of (E2J with 7 = An- 1 / 10 : 



P 



E E 

ieu je[n]\u 



W; 



> n 



-1/10 



P 



E E 



Wi 



E E 

ieUje[n]\u 



w 



> 



ANn^ 10 



i£U j£[n]\U 

< 2 exp (-8n 13/10 



> rSn- 1 ' 10 < 



(24) 

In the first inequality we used the fact AN < n 2 , while the second inequality 
follows from ( 120]) and the condition iV > n 3 ^ 2 . 



is greater than that of 
> n -1 / 10 ) can be bounded by 



As for large values of n the right hand side of 
the probability F (max UC [ n] J2i<=uJ2je[ n ]\u w ij 
the number of possible 2-partitions of [n] times the right hand side of (]2 

P (\\Wa Wn h > 6 ' (^r 1/10 ) < 2" +1 exp (-8n 13 / 10 ) . 



(25) 



As the right hand side of f]2"5"]) is a general term of a convergent series, the 
statement of the theorem follows by the Borel-Cantelli Lemma. 

Remark 28 Let A n := B n + W n and ni, . . . ,n q — > 00 in such a way that 
lim^oo ^ = Ti (i = 1, . . . , q), n = J2i=i n ii further, for the uniform bound 
K of the entries of the "noise" matrix W n the condition U~9\) is satisfied. 
Under these conditions, Theorem [2E\ implies that the "noisy" graph sequence 
(Ga„) C Q converges almost surely in the <$□ metric. It is easy to see that the 
almost sure limit is the stepfunction Wh, where the factor graph H = Gb,JP 
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does not depend on n, as P is the q-partition of the vertices of G& n with 
resepect to the blow-up (with cluster sizes n\, . . . , n q ). Actually, the vertex- 
and edge-weights of the weighted graph H are 



a l {H)=r l (ie[q}), fa(H) = =^ = Pij G [q]). 

Remark 29 Under the conditions of Remark H3, as (G An ) C Q converges 
almost surely and f q , f°, /*, \f q , fi* are testable graph parameters, the se- 
quences f q (G An ), f q (G An ), fg(G An ), /J,q(G An ), Vq(G An ) also converge almost 
surely. However, the almost sure limits are the corresponding extended f q - 
or p, q -values of the graphon Wh and not the f q - or \i q -values of H . For 
example, in Section 4, we have shown that f q (G An ) — > (n — > oo), but 

f q (H) = Et 1 iEU + iPv^°- 
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